Explained variation for logistic regression.
نویسندگان
چکیده
Different measures of the proportion of variation in a dependent variable explained by covariates are reported by different standard programs for logistic regression. We review twelve measures that have been suggested or might be useful to measure explained variation in logistic regression models. The definitions and properties of these measures are discussed and their performance is compared in an empirical study. Two of the measures (squared Pearson correlation between the binary outcome and the predictor, and the proportional reduction of squared Pearson residuals by the use of covariates) give almost identical results, agree very well with the multiple R2 of the general linear model, have an intuitively clear interpretation and perform satisfactorily in our study. For all measures the explained variation for the given sample and also the one expected in future samples can be obtained easily. For small samples an adjustment analogous to Radj2 in the general linear model is suggested. We discuss some aspects of application and recommend the routine use of a suitable measure of explained variation for logistic models.
منابع مشابه
A NEW APPROACH FOR PARAMETER ESTIMATION IN FUZZY LOGISTIC REGRESSION
Logistic regression analysis is used to model categorical dependent variable. It is usually used in social sciences and clinical research. Human thoughts and disease diagnosis in clinical research contain vagueness. This situation leads researchers to combine fuzzy set and statistical theories. Fuzzy logistic regression analysis is one of the outcomes of this combination and it is used in situa...
متن کاملبهکارگیری متغیرهای پنهان در مدل رگرسیون لجستیک برای حذف اثر همخطی چندگانه در تحلیل برخی عوامل مرتبط با سرطان پستان
Background and Objectives: Logistic regression is one of the most widely used generalized linear models for analysis of the relationships between one or more explanatory variables and a categorical response. Strong correlations among explanatory variables (multicollinearity) reduce the efficiency of model to a considerable degree. In this study we used latent variables to reduce the effects of ...
متن کاملComparing the importance of prognostic factors in Cox and logistic regression using SAS
Two SAS macro programs are presented that evaluate the relative importance of prognostic factors in the proportional hazards regression model and in the logistic regression model. The importance of a prognostic factor is quantified by the proportion of variation in the outcome attributable to this factor. For proportional hazards regression, the program %RELIMPCR uses the recently proposed meas...
متن کاملStudy of factors affecting on neonatal hyperbilirubinemia according of optimization in logistic regression model.
Aim and Back ground: Neonatal hyperbilirubinemia is most common reason to re-admission in hospital. The aim of this study is to investigate effect of risk factors such as hypertension, age and type of delivery in mothers on neonatal hyperbilirubinemia based on logistic regression model. Method and material: In this descriptive study, the 300 mother's documents which refer to hospital for hospit...
متن کاملVariation analysis of wheat F3 lines produced by crossing between Azar2 and 87-Zhong291 cultivars using RAPD method in drought stress condition
In order to analyze variation and find an efficient selection approach, we used crossing results between two wheat cultivars, Azar2 and 87-Zhong291. F3 plants (374 lines) and four check cultivars planted in the form of augmented experiment. For germination, seeds were irrigated once. After using multiple statistics methods such as cluster analysis, 79 different lines were selected and used for ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Statistics in medicine
دوره 15 19 شماره
صفحات -
تاریخ انتشار 1996